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Background 


•  Vessels  report  themselves  through  various  means: 

-  Automatic  Identification  System  (AIS) 

-  Vessel  Monitoring  System  (VMS) 

-  Long  Range  Identification  &  Tracking  (LRIT) 

-  Weather  reports 

-  Pre-arrival  notifications  to  authorities 


-  etc. 

Self-reporting  increasingly  becoming  mandatory 
(e.g.,  AIS  now  required  for  all  ships  >  300  GT 
engaged  on  international  voyages). 


A  significant  fraction  of  the  data  entering  the 
recognized  maritime  picture  is  now  coming  from 
self-reports. 
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Issues 


•  Self-reporting  systems  are  not  tamper-proof. 
Vessels  may  report  incorrect  information, 
deliberately  or  not. 


•  Self-reports  should  be  verified,  to  some  extent, 
using  other  information  sources. 

•  Various  means  of  verification:  aerial  patrols,  surface 
patrols,  imagery,  etc. 


Verifying  all  self-reports  is  impractical  and  not  cost- 
effective. 

Key  questions  from  the  operators: 

-  How  many  vessels  should  be  verified? 

-  Which  ones? 
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f£}7  How  Many? 


How  many  self-reporting  vessels  should  be  verified  in 
order  to  reach  a  certain  confidence  level  that  the 
information  received  from  all  unverified  vessels  is  correct? 


•  To  answer  this,  some  prior  estimate  of  the  fraction / 
of  the  vessel  population  that  transmits  incorrect 
information  is  required. 

•  Analysis  performed  on  AIS  reports  (Gauthier,  2006). 
Estimated /  for  three  types  of  “errors”: 

-  insignificant  (e.g.,  non-mandatory  field  missing); 

-  minor  (e.g.,  typo  in  ship  name);  and 

-  serious  (e.g.,  wrong  ship  location). 
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How  Many? 

•  Let 


(1  -/)  =  probability  that  each  vessel  reports  correct 
information; 

Nr  =  number  of  self-reporting  vessels;  and 

Nv  =  number  of  self-reporting  vessels  verified. 


Assume  independence  between  vessels  and  perfect 
error  recognition. 

Also  assume  uniform  sampling  without  replacement 
(each  vessel  is  randomly  selected  and  verified  only 
once  during  a  patrol). 


Probability  P  that  all  unverified  vessels  report  correct 
information  =  (1  -/)  (nr~nv) 

Nv  =  [log(P)  -  Nr  log(  1  -/)]  /(!-/) 
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How  Many? 


Example:  Probability  that  all  unverified  vessels  report 
correct  information  (with  NR  =  100  vessels  reporting) 
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r^li^7  How  Many? 
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Example  of  a  simple  spreadsheet-based  estimation  tool 
for  operators. 
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Requirement  for  the  verification  of  self-reporting  vessels 

A 

Average  (historical)  fraction  of  vessels  NOT  reporting  correctly 

0.004 
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Number  of  self-reporting  vessels  verified 
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Which  Ones? 


•  Uniform  sampling  is  not  the  most  efficient  selection 
method. 

•  What  if  a  meaningful  probability  ft  of  reporting 
incorrect  information  can  be  assigned  to  each 
vessel  i  =  1,...,  Nr? 

•  Optimal  verification  strategy: 

-  Sort  fj ’s  from  largest  to  smallest  value. 

-  Focus  available  time  and  resources  on  vessels 
with  the  highest f{’s. 
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Which  Ones? 

Example:  Probability  that  all  unverified  vessels  report 
correct  information  (with  NR  =  1 00  vessels  and  fi=i-f/2) 


Number  of  vessels  verified  (Nv) 


Which  Ones? 

Case  with  replacement 


•  Consider  now  a  very  large  area  monitored  over  a 
very  long  period  (e.g.  1  year) 

•  A  vessel  may  then  be  verified  multiple  times  by 
multiple  patrols,  patrol  assets,  or  agencies.  This  is 
sampling  with  replacement. 


Does  the  optimal  verification  strategy  remains  the 
same? 
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Which  Ones? 

Case  with  replacement 


•  No! 

•  “Strong  profiling  is  not  mathematically  optimal  for 
discovering  rare  malfeasors”  (W.h.  Press,  Proceedings 
of  the  National  Academy  of  Sciences,  Feb  2009) 

•  When  sampling  proportionally  to  the  ’s  (strong 
profiling),  resources  are  wasted  on  the  repeated 
verification  of  vessels  that  are  correctly  reporting. 

•  Priors  should  be  used,  but  only  weakly. 
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Which  Ones? 

Case  with  replacement 


•  Optimal  verification  strategy: 

-  Estimate  for  every  vessel. 

-  Perform  square-root  biased  sampling 
(proportional  to  47,  )  on  the  vessel  population. 

•  Example: 


-  If  a  vessel  is  a  priori  25  times  more  likely  than 
average  to  report  incorrect  information,  it 
should  be  verified  only  5  times  more 
frequently  than  average. 

Theoretically  valid  for  any  type  of  vessel  profiling, 
although  not  all  profiling  methods  may  be  legal  or 
feasible. 
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Operational  Considerations 


Self-report  verification  is  generally  not  the  primary 
objective  of  surveillance  activities. 

Even  if  verification  is  performed  as  a  secondary 
task,  operators  should: 

-  record  number  of  verifications  and  details  of 
incorrect  reports; 

-  record  reporting  vessel  characteristics;  and 

-  refine  priors  as  more  data  becomes  available. 

In  practice,  sampling  involves  tradeoffs  between 
verification  effort  (time,  distance)  and  the  number  of 
vessels  verified. 


If  priors  can  be  assigned  to  different  categories  of 
vessels,  sampling  should  be  kept  roughly 
proportional  to  JJ.  . 
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Conclusion 


•  Simple  statistical  methods  can  be  easily 

implemented  by  authorities  to  assess  the  reliability 
of  self-reports. 


•  More  sophisticated  Bayesian  procedures  could 
also  be  investigated  (e.g.,  procedures  based  on 
anomaly  detection). 


Good  estimates  of  the  priors  are  key  to  reliability 
assessments.  Verification  results  should  be 
compiled  and  analyzed  on  a  regular  basis  to  refine 
priors. 

Enforcing  regulations  more  stringently  would 
increase  self-report  reliability  in  the  medium  term. 
Verification  data  could  be  used  to  measure  the 
impact  and  cost-effectiveness  of  this  enforcement. 
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Comments? 

Questions? 
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